Literal Movement Grammar
   HOME

TheInfoList



OR:

In
linguistics Linguistics is the science, scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure ...
and
theoretical computer science computer science (TCS) is a subset of general computer science and mathematics that focuses on mathematical aspects of computer science such as the theory of computation, lambda calculus, and type theory. It is difficult to circumscribe the ...
, literal movement grammars (LMGs) are a grammar formalism intended to characterize certain extraposition phenomena of natural language such as
topicalization Topicalization is a mechanism of syntax that establishes an expression as the sentence or clause topic by having it appear at the front of the sentence or clause (as opposed to in a canonical position further to the right). This involves a phrasa ...
and cross-serial dependency. LMGs extend the class of
context free grammar In formal language theory, a context-free grammar (CFG) is a formal grammar whose production rules are of the form :A\ \to\ \alpha with A a ''single'' nonterminal symbol, and \alpha a string of terminals and/or nonterminals (\alpha can be empt ...
s (CFGs) by adding introducing pattern-matched function-like rewrite semantics, as well as the operations of variable binding and slash deletion. LMGs were introduced by A.V. Groenink in 1995.Groenink, Annius V. 1995. Literal Movement Grammars. In ''Proceedings of the 7th EACL Conference''.


Description

The basic rewrite operation of an LMG is very similar to that of a CFG, with the addition of
arguments An argument is a statement or group of statements called premises intended to determine the degree of truth or acceptability of another statement called conclusion. Arguments can be studied from three main perspectives: the logical, the dialectic ...
to the non-terminal symbols. Where a context-free rewrite rule obeys the general
schema The word schema comes from the Greek word ('), which means ''shape'', or more generally, ''plan''. The plural is ('). In English, both ''schemas'' and ''schemata'' are used as plural forms. Schema may refer to: Science and technology * SCHEMA ...
S \to \alpha for some non-terminal S and some string of terminals and/or non-terminals \alpha, an LMG rewrite rule obeys the general schema X(x_1, ..., x_n) \to \alpha, where X is a non-terminal with arity n (called a
predicate Predicate or predication may refer to: * Predicate (grammar), in linguistics * Predication (philosophy) * several closely related uses in mathematics and formal logic: **Predicate (mathematical logic) **Propositional function **Finitary relation, o ...
in LMG terminology), and \alpha is a string of "items", as defined below. The arguments x_i are strings of terminal symbols and/or variable symbols defining an argument pattern. In the case where an argument pattern has multiple adjacent variable symbols, the argument pattern will match any and all partitions of the actual value that unify. Thus, if the predicate is f(xy) and the actual pattern is f(ab), there are three valid matches: x = \epsilon,\ y = ab;\ x = a,\ y = b;\ x = ab,\ y = \epsilon. In this way, a single rule is actually a family of alternatives. An "item" in a literal movement grammar is one of * f(x_1, \ldots, x_n), a predicate of arity n, * x \textf(x_1, \ldots, x_n), a variable binding x to the string produced by f(x_1, ..., x_n), or * f(x_1, \ldots, x_n)/\alpha, a slash deletion of f(x_1, ..., x_n) by the string of terminals and/or variables \alpha. In a rule like f(x_1, ..., x_m) \to \alpha\ y \text g(z_1, ... z_n)\ \beta, the variable y is bound to whatever terminal string the g predicate produces, and in \alpha and \beta, all occurrences of y are replaced by that string, and \alpha and \beta are produced as if terminal string had always been there. An item x/y, where x is something that produces a terminal string (either a terminal string itself or some predicate), and y is a string of terminals and/or variables, is rewritten as the empty string (\epsilon) if and only if g(y_1, ..., y_n) = z, and otherwise cannot be rewritten at all.


Example

LMGs can characterize the non-CF language \ as follows: :S() \to x\textA()\ B(x) :A() \to a\ A() :A() \to \epsilon :B(xy) \to a/x\ b\ B(y) c :B(\epsilon) \to \epsilon The derivation for ', using parentheses also for grouping, is therefore S() \to x\textA()\ B(x) \to x\text(a\ A())\ B(x) \to x\text(aa\ A())\ B(x) \to x\textaa\ B(x) \to aa\ B(aa) : \to aa\ a/a\ b\ B(a)\ c \to aab\ B(a)\ c \to aab\ a/a\ b\ B()\ cc \to aabb\ B()\ cc\ \to aabbcc


Computational power

Languages generated by LMGs contain the context-free languages as a proper subset, as every CFG is an LMG where all predicates have arity 0 and no
production Production may refer to: Economics and business * Production (economics) * Production, the act of manufacturing goods * Production, in the outline of industrial organization, the act of making products (goods and services) * Production as a stati ...
rule contains variable bindings or slash deletions.


References

{{Formal languages and grammars Formal languages Grammar frameworks